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Abstract 

We prove a version of the stochastic maximum principle, in the sense of Pontryagin, for 
the finite horizon optimal control of a stochastic partial differential equation driven by an 
infinite dimensional additive noise. In particular we treat the case in which the non-linear 
term is of Nemytskii type, dissipative and with polynomial growth. The performance 
functional to be optimized is fairly general and may depend on point evaluation of the 
controlled equation. The results can be applied to a large class of non-linear parabolic 
equations such as reaction-diffusion equations. 


1 Introduction 


In the framework of classical optimal control theory, the maximum principle in the sense 
of Pontryagin is generally understood as a necessary condition for optimality of a control 
and its associated trajectory. Under some additional assumptions, the conditions may also 
become sufficient to ensure optimality. In the context of stochastic systems, a very general 
formulation of the maximum principle was obtained by Peng in , in the case of a controlled 
finite dimensional equation driven by the Brownian motion. This work was later extended 
in several directions by many authors. Here we pay particular attention to the case of 
infinite dimensional controlled systems, in particular controlled stochastic partial differential 
equations (SPDEs), on a finite time horizon. 

The hrst result in infinite dimensions is due to Bensoussan in the case of a convex set 


of control actions and a possibly infinite-dimensional Brownian noise. Later in 17j the case 


of diffusion coefficient unaffected by the control parameter was treated. Several versions of 
the stochastic maximum princi ple for general evolution equations were proved in 2^, other 


versions can be found in 13]. m and [l9|, under various conditions. The results in |l4l |. 


15l | can be directly applied to a large class of concrete controlled SPDEs of parabolic type, 
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but only in the case of a finite-dimensional Brownian noise. It is worth emphasizing that 
the general case (in which the control parameter enters the diffusion term, the set of control 
actions is general -in particular non convex- and the noise is infinite-dimensional) is still an 
open problem. 

The aim of this work is to prove a stochastic maximum principle for a class of controlled 
semilinear SPDEs of reaction-diffusion type written formally as 

^((.{) = AX{t,0 + /{X{t,(),u(t)) + B^(t, 

' X{0,() = 

= 0 on do, 



where ^ € O a bounded subset of and t G [0, T] for some hxed T > 0. u{-) is the control 
process, taking values in a general space of control actions U, A is an operator of elliptic type, 
xq a given initial condition, d'^w/d^dt is a symbolic notation for a space-time white noise, B 
specifies the covariance operator of the noise, which enters the equation in an additive way, 
and finally f : M x U —)-]Risa given function. Thus, the non-linear part of the drift is the 
so-called Nemytskii operator (or superposition operator) associated to /. 

In the context of the general theory of SPDEs, the well-posedness of this type of equation 
(for a given control 'u(-)) is known under very general conditions on the function /, see for 
instance [1] for a systematic exposition. Particular attention is payed to the case when / 
is decreasing, which corresponds to a dissipativity assumption on the non-linear part of the 
drift, but it is otherwise very general, in particular it does not satisfy any Lipschitz condition. 
Most of the existing results were proved with the primary aim of studying the asymptotic 
behavior of an uncontrolled SPDE. In dissipativity assumptions are required on the drift 
in connection with optimal control of an SPDE, with the aim to prove existence of relaxed 
optimal controls. The analysis is often carried out by re-writing the SDPE as an abstract 
evolution equation whose trajectories lie in the space H := Lp‘{0) or, under appropriate 
conditions, in the space E := C{0). 

On the contrary all of the previously mentioned references on the stochastic maximum 
principle require the function / to be at least Lipschitz continuous. It is our purpose to 
prove a version of the stochastic maximum principle by imposing to the nonlinear term only 
dissipativity and polynomial growth conditions, along with some smoothness property. Even 
for finite dimensional controlled stochastic equations, this seems to be an issue that was 
considered only very recently: see 211]. On the other hand, we limit ourselves to the case 


where the noise is additive and uncontrolled. 

In order to achieve gretaer generality we formulate our results for an abstract stochastic 
controlled evolution equation with additive noise in the Banach space E, of the form 


r dX{t) = [AX{t) + F{X{t),u{t))] dt + BdW{t), 

{x{0)=xo. 

We assume that ^ is a dissipative sectorial operator in E, and thus the the generator of an 
analytic contraction semigroup of linear bounded operators S{t), t > 0. We assume that S 
extends to a semigroup on H satisfying in particular the regularizing property S{t){H) C E 
for t > 0, as well as other assumptions. The non-linear term E : E —)• E is of Nemytskii 
type and it is dissipative in E. The noise IT is a cylindrical Wiener process in another 
separable Hilbert space K and B : K ^ H is a, linear bounded operator. Under suitable 
assumptions on B we can guarantee that the stochastic convolution Jq S(t — s)BdW(s), 
t G [0, T], admits an E-continuous version and therefore by knonw results (see e.g. 0 ) that 
the state equation admits a unique mild solution in E for any admissible control. We take 
special care in verifying that our abstract assumptions can be effectively checked in concrete 
cases like equation ([T|). 
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In our context the stochastic maximum principle is the statement that to any optimal pair 
{u,X) we can associate a pair of adjoint processes {p,q) in such a way that the Hamiltonian 
function, constructed by means of u, X and p, satisfies a maximum (or minimum) condition, 
see inequality m below. Our main results are two versions of the stochastic maximum 
principle, with slightly different assumptions. In the first one the adjoint processes (p, q) 
are defined and uniquely characterized using a duality argument. In the second one it is 
shown that (p, q) can also be characterized as the unique mild solution to a backward SPDE 
(BSPDE). While it is natural to expect that the first adjoint process p takes values in the 
dual space E' of E, it seems difficult to prove directly well-posedness of a BSPDE as an 
equation in E', due to the lack of an efficient stochastic calculus in this space. To avoid this 
problem we formulate the BSPDE as an equation in a bigger Hilbert space where we can 
use standard stochastic calculus and then we prove that the solution is indeed more regular. 
In fact, under suitable assumptions we prove that the first adjoint process p actually takes 
values in H', with a possible blow up when t ^ T. In the proof we make frequent use of 
duality arguments in order to obtain a priori estimates. 

We finally mention that developing the theory in the space E of continuous functions 
allows to consider fairly general cost functionals. For example we are able to formulate a 
control problem in which we optimize the evolution of the state evaluated at fixed points of 
the space, see (12611 . 

The plan of the paper is as follows. After some preliminary results on the well-posedness 
of the state equation and the formulation of the optimal control problem in Section 2, we 
state our main results. Theorems [Q and Ea in Section 3. Section 4 is devoted to some 
examples, where we show that our general results can indeed be applied to various concrete 
controlled SPDEs. In Section 5 we analyze the spike variation technique and in Section 6 
we prove Theorem 13.11 Finally, in Section 7 we study the BSPDE for the adjoint processes, 
which immediately leads to the proof of Theorem 13.21 


2 Notations, assumptions and preliminaries 

Let O C be an open bounded subset of with boundary dO of class C^. We denote by 
H the Hilbert space L^(0,M) with inner product (■> ')h and by E the Banach space (7(0, M) 
endowed with the supremum norm j-jg. Moreover we denote by (or simply 

the duality pairing in E' x E, where E' is the topological dual of E, and by h’ {'■:')h 
duality between H and H'. Given a probability space (D,T', P), by a cylindrical Wiener 
process we mean a family of linear mappings from K to L^(D) 

K3h^Wl^ € L2(D) 


such that the two following conditions hold 

1- W)t>o is a real continuous Wiener process, for every h G AT; 
2. • Wt^) = {h, k)j^, for every h,keK. 

Remark 2.1. We can also think the cylindrical Wiener process as 

OO 

W{t) = J2(^kWk{t) 

k=l 


where is a complete orthonormal system of K and Wk{t) are mutually independent real 
Brownian motions defined on (12,7^, P). It is worth noting that the series above does not 
converge in AT, but in any Hilbert space ATi D K such that the embedding is Hilbert-Schmidt 


see 


10| for a detailed exposure). 
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We use the natural filtration {Ft)t>o associated to W, augmented in the usual way with the 
family of P-null sets of J-, and we denote by V the progressive a-algebra on 17 x [0,T], for 
some T > 0. If i? is any Banach space, for any p > 1, A G M and T > 0 we define 

• B), the set of all J^^-measurable random variables Y with values in B such that 


Ljr{Q X [0,T];B), the set of all (J't)-progressive processes with values in B such that 

, f-T N 1/2 

\\^\\Lj,{nx[0,T]-,B) — {^ < oo; 


L^{Q,;C{[0,T]; B)) the set of all (J^t)-adapted continuous processes with values in B 
such that 

ll^llL5,(O;C([0,r];B)) = 

L^(17; L^([0, T], A; i?)), the set of all (J^t)-progressive processes with values in B such 
that 


lLG(O;L2([0,r],A;B)) 




< oo. 


Remark 2.2. If R is a Hilbert space then also the spaces Ljr{[0,T]] B) and ^^^([0, T], A; R) 
are Hilbert. Moreover, it is easy to see that 

[L3.([0,T],A;R)r = L3.([0,r],-A;R) 

where the duality is given by {x, y) = E (x(t),y(t)) dt. 

If K, H are Banach spaces we denote C{K, H) the space of linear bounded operators T from 
K to H, endowed with the usual operator norm ||T||£(^ We set C{K) := C{K,K). When 
K,H are Hilbert spaces we write C 2 {K,H) for the space of Hilbert-Schmidt operators from 
K to R, that is, linear operators T G C{K, H) such that 

igiiA(k,h) = < °° 

km 

where {ek)km is any orthonormal basis of K. We set C 2 {K) := C 2 {K,K). 

The space of control actions is a general separable metric space U endowed with its Borel 
fj-algebra B{U). A control process is a progressive process with values in U. We 

denote with U the space of admissible controls. 


2.1 First assumptions on the controlled state equation 

The aim of this work is to study a controlled stochastic partial differential equation of the 
form 

/ dX{t) = [AX{t) + F{X{t),u{t))] dt + BdW{t) 

\A(0)=x, 

and try to give some necessary conditions for the existence of a control process which minimize 
a cost functional of the following type 

J{u)=E [ L{t,X{t),uit))dt + EGiX{T)). (3) 

Jo 
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A control process u for which ([3|) attains its minimum is called optimal, i.e. 

J(h)= inf J(m(-)). 

u{-)eu 

u, together with its corresponding trajectory X, will be called an optimal pair {u,X). 

Let us give some assumptions on the equation we are considering. 

Hypothesis 2.1. 1. A : D{A) C E ^ E is a sectorial operator in E, whieh generates 

a contraction semigroup S{t), t > 0. Moreover, S{-) extends to a strongly continuous 
semigroup in H, still denoted with the same symbol. We assume that S{t){H) C E for 
t > 0, and for some constants C > 0 and A G [0,1) it holds that 

Il'^(^)ll£(j7,£;) — jx^ ^ ^ (^) 

2. B ^ C{K,H), S{t)B G C 2 {K,H) for almost every t G [0, T], 

J < °° (^) 

and the stochastic convolution 

WA{t):= f S{t- s)BdW{s), tG[0,r], 

Jo 

admits a modification with trajectories being continuous functions with values in E. 

3. X € E. 

Remark 2.3. 1. Since A is sectorial, it generates an analytic semigroup S{-) of bounded 

linear operators in E, non necessarily strongly continuous, and D{A) may not be dense 
in E (see e.g. 0). We additionally require that the semigroup is contractive, that is it 
satifies < 1 for every t > 0, and that it has an extension as required above. 

2. The property ([3]) is a form of ultracontractivity, with the additional quantitative re¬ 
quirement that A < 1. We will use this property in Proposition 16.11 

3. Under for every t G [0,T], the stochastic integral WA{t) is well defined as an 
equivalence class of random variables with values in H. We require that there exists 
an U-valued continuous stochastic process that, considered as a process in R D Li, is a 
modification of Wa- 

4. All the results that follow admit easy generalizations to the case when the drift F is 
time dependent or even stochastic. Similarly, the initial condition x & E might be 
replaced by a random variable. We keep the previous setting to simplify the notation. 

In order to state Theorems 13.11 and 13.21 below, we need to require that the semigroup is 
indeed more regular. To do it, we formulate two different hypotheses 

Hypothesis 2.2. There exists a Hilbert space V, continuously and densely embedded in E, 
and a constant (3 such that 

S{t){v)cv, \\s{t)\\^^y^<(3, tG(o,r]. 

Hypothesis 2.3. There exists a Hilbert space V, continuously and densely embedded in E, 
and a constant ft such that 

S{t){E)cV, l|5(i)ll£(i/)</3, tG(0,T]. 
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The second assumption is clearly more stringent. Note however that in both cases the 
bound concerns the norm in CiV). It will be clear later on (see Proposition 16.11 and Section 
7) which are the motivations for the introduction of the new space V. 

Regarding the cost functional, we need the following 

Hypothesis 2.4. i. The functionals L : 0. x [0,T] x E x U —>■ R and G : Q x E ^ M. 
are measurable with respeet to V ®B{E) ®B{U) and 0(R) (respectively Et ®B[E) and 
B{R)). 

2. For each {t,u) G [0,T] x U, the operators L and G are Frechet differentiable on E. 
Moreover there exist K >0, k > 0 such that, P-a.s. 

\L{t,x,u)\ + \DxL{t,x,u)\^ < K{1 + |x|^), t G [0,T], x ^ E, u ^ U, 

\D,^G{x)\^ < K{l + \x\'f), xeE. 

2.2 Assumptions on the drift as a Nemytskii operator 

Here we describe the non-linear term F in the state equation. We suppose that a function 
/ : R X t/ —> R is given and that F : E x U ^ E is defined by 

F{x, u)(0 = /(x(0, u), feO 

for any continuous map x : O —>■ R and u ^ U. Thus, E is the so-called Nemytskii operator 
associated to the real function /, on which we make the following assumptions. 

Hypothesis 2.5. 1. f : RxU ^ R is measurable. For every a gR the map u f{a,u) is 

continuous on U, and for every u G U the map a f {a, u) is continuously differentiable 
on R, with derivative denoted by f'{a,u). Moreover there exist C > 0, /c > 0 such that 

\f{<^E)\ + \f'{<y,u)\<C{l + \cr\^), aGR,uGU. (6) 

2. There exists f gR such that 

f'{cr, u) < (5, (T GR, u GU. 

If the map / satisfies the above assumption, the Nemytskii operator F cannot be defined 
as a map from H x U into H. Even if it could be defined, under more restrictive growth 
assumptions on /, it would fail to be differentiable in general: indeed, it is known that a 
Nemytskii operator T : FI ^ F[ is Frechet differentiable if and only if T is linear (see [2], 
page 20). However, as a map from E x U to E, F is well defined and has some regularity 
properties, as shown in the following 

Lemma 2.6. The functional F : E x U ^ E is Frechet differentiable on E. Moreover the 
Frechet differential DxF acts on every h G E as a multiplication operator, namely 

[DxF{x, u) • h] (0 = /'(x(0, n) • h{f), e € O. (7) 

Proof. Fix x,h G E, u G U. For G O, s 0 we have 

f{x{f) + sh{f,),u)-f{x{f),u)=[ f{x{C)+ 9sh{f,),u)sh{f,)d9. (8) 

20 
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Fix e > 0, let (^ > 0 be such that \f'{x{^) + y,u) — f'{x{^),u)\ < e, for \y\ < 6. Such a 6 
exists since f'{-,u) is continuous on a compact set containing the image of x(-). Now choose 
s small enough such that |0s/i(^)| < 5. Taking the limit we obtain 


sup -|/(x(0 + sh{C),u) - f{x{C),u) - f'{x{C),u) sh{^)\ 


i&o s 



ieo Jo 


< esup|/i(OI 


which proves (j7|l. 


□ 


To proceed further we need to recall the following 

Definition 2.1. A map g : D{g) C E ^ E is called dissipative if for all x,y € D{g) and 
a >0 it holds 

\x-y\E < \x-y-a{g{x) - g{y))\E- 
Equivalently, if there exists z* & d\x — y\^ such that 

e{ 9 {x) - g{y),z*)j^, ^ x,y e D{g). 

Here we denote by d\-\^ the sub-differential of the norm in E. 

In the special case in which E is a Hilbert space, this condition coincides with the mono¬ 
tonicity assumption {g{x) — g{y),x — y)^ < 0, for all x,y € D{g). 

We refer e.g. to for basic properties of dissipative mappings. We also take from 
0, page 180, the following two results on the properties of the operator F and its Yosida 
approximations. 

Lemma 2.7. Under Hvvothesis \2.5[ if F is the Nemytskii operator associated to f then there 
exists a constant c € M such that the following are true: 

(i) F{-,u) — cl is dissipative in E, for every u G U; 

(a) Ifx G E then sup„g[/||D 3 ,F(x,< c(l + |x|^); 

(Hi) If x,h ^ E then there exists 5h G d\h\^ such that sup^gf/ {DxF{x,u)h,5h) e ^ ^\^\e’ 

(iv) For x,h & E it holds sup^g^/ {DxF{x, u)h, h) ^ < c|/i|^. 

Lemma 2.8. For any a > 0, cr G M and u & U, consider the resolvent map Ja{cr,u) = 
(/ — af)~^{a, u) and define faio', u) = f{Ja{a, u),u) — cJa{a, u). If we denote Fa{x, u){^) = 
fa {x{f),u), then Fa is dissipative and Lipschitz-continuous both in H and in E. Moreover 


[DFaix,u)y]{(,) = faixif,),u) ■ y{f,), f,eO,u£U. 


and for any R > 0 we have 


lim sup \Fa{x,u) — F{x,u)\e = 0, u € U. 


Remark 2.4. Let us notice that the operators Fa defined above coincide with the usual 
Yosida approximations of F. We also note that from Q it follows that 


|/„((T,u)| + |/^(cr,n)| < ^(l + ct G M, n G [/. (9) 


(9) 


for some C > 0 independent of a. 
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2.3 The state equation 

Now we are in position to study the abstract form of the state equation ([2]). While ([5]) is 
merely a formal writing, the precise formulation of the state equation is a so-called mild 
formulation: an ill-valued continuous adapted process X{t) is a mild solution of the SPDE 
above if, P-a.s., 

X{t) = S{t)x + [ S{t- s)F{X{s),u{s))ds + WA{t), t € [0, T], 

Jo 

where Wa is the ill-continuous modification of fg S(t — s)BdW(s), t G [0,T]. Under the 
previous assumptions we can state the following theorem (compare [l0|). 

Theorem 2.9. Assume that hypotheses \2A\ and \2f^ hold true, then equation m admits a 
unique mild solution X. Moreover X G L^{^; ^([0, T]; iH)) for every p > 1. 


3 Statement of the main results 

Let V' be the dual of the Hilbert space V introduced in Hypothesis 12.21 Let us identify H 
with H' by the Riesz isometry. Then we obtain the following continous dense inclusions 

VcEcHcE'cV. 

Now we can state the two building blocks of our main result. 

Theorem 3.1. Suppose Hvvotheses \2.1\. Ig.gl IS.fl and 1 2., 51 hold, let r' G (1,2) he fixed arbi¬ 
trarily and let A he as in Hvvothesis \2. R Suppose that {u,X) is an optimal pair. Then there 
exists a progressive process p with values in H' satisfying 

^{.lo ^ 

and for which the following inequality holds P-o.s. for almost every t G [0, T]; 

'H{t,v,X{t),p{t)) —'H{t,u{t),X{t),p{t)) > 0, foreveryv£U, (10) 

where TLit, u, x,p) := L{t, x, u) h' (P; T(x, u))^^ is the Hamiltonian function of the system. 

We note that in Theorem [Q no characterization of p is given. It will be clear from the 
proof that p is the first component of a pair [p, q) which is uniquely determined by a duality 
argument. In Theorem 13.21 we give a more precise characterization of the pair (p, q) as the 
solution of an adjoint equation, which is backward in time. 

Theorem 3.2. Let {u,X) be an optimal pair. If Hvvotheses \2.R \2.3\. \2.4\ and \2.5[ hold and 
r’ G (1,2) then there exists a pair of progressive processes {p, q) with values in H' x L 2 {K, V') 
for which inequality (uni) holds. Moreover 

^\Io ^ 

and the pair (p, q) is the unique mild solution of the Backward SPDE (l53l) below. 

Remark 3.1. It is worth noting that Hypothesis 12.31 is used only in the proof of uniqueness 
of the Backward SPDE. For the existence part it is sufficient that the semigroup preserves 
V, i.e. Hypothesis 12.21 


4 Examples 


The aim of this section is to present some concrete examples of stochastic control problems 
for SPDEs which can be treated using our results and where the general assumptions stated 
above can be effectively cheked. We begin by giving some general sufficient conditions for the 
space-time continuity of the stochastic convolution, that we have assumed in Hypothesis 12.11 
2. Then we focus on the case in which the sectorial operator H is a realization of the Laplace 
operator with Dirichlet boundary conditions: we give examples of covariance operators B for 
which Hypothesis 12.11 2 can be readily verified and we show how the abstract Hilbert space 
V can be chosen. Last, we briefly discuss the form of the cost functionals in which we are 
more interested in. 

4.1 A general condition for the space-time continuity of WA{t) 

In the literature, most of the sufficient condition for space-time continuity of the stochastic 
convolution can only be applied to one-dimensional domains O C M and to few simple domains 
in higher dimensions. Here we prove some statements concerning more general domains in 
M”, n > 1, with boundary regular enough, mainly relying on the results from 0. Although 
several arguments are usual, we could not find any reference that includes the results we are 
going to prove. 

The hrst step in this direction is the study of the Ornstein-Uhlenbeck process associated 
with the stochastic equation 

dzt = Aztdt + BdWf. (11) 

The existence of mild solution W^it) in H to the above equation is easily obtained by imposing 
the following trace condition, for any t >0, 

l\\S{s)B\\^^^H^ds = l\v[S{s)BB*S{sr]ds < oo. (12) 

Under this assumption, WA{t) is a mean-square continuous Gaussian process with values in 
H and it is given by 

WA{t)= [ S{t- s)BdW{s). (13) 

Jo 

For our purposes, we need Wa to be more regular and we make the following 

Hypothesis 4.1. (i) For any p > 2, the semigroup S{t) extends uniquely to a strongly 

continuous semigroup in L^{0). 

(a) For all e G (0,1) and p > 2, there exist r >2 and K such that the following condition 
holds 

\S{t)\ W£.P(C)) < Vx€LP(0), 

where with we denoted the fractional Sobolev space. 

(in) There exists an orthonormal basis of H which diagonalize both A and B. That is, 
there exist two sequences of real positive numbers pk and real bk such that pk -boo 
as k ^ -boo and 

Aeu = -pkGk, Bek = bkek, A; G N; 

(iv) Each efc(-) is bounded. Let M > 0 and Ck be an increasing sequence, then it holds that 

|efc(OI < Mck, Cfc G N,^ G O; 
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(v) There exists a G (0, such that 

+ 00 

< 00 

k=l 

Now we are in position to state the following 

Theorem 4.2. Assume Hypotheses (i)-(v) hold. Then the stochastic convolution Wa '■ 
[0,+ 00 ) X O —>■ M is continuous, P-a.s. Moreover, if p >2 we have 

E sup \WAit)\^E < °° 
te[o,T] 

Before proving the theorem we recall a useful analytic lemma (cfr. [^, page 23). 

Lemma 4.3. Assume Hypotheses (i)-(ii) hold. Let T > 0, p > 2 and f G L^([0,T] x O). If 
we set ^ 

F{t)=[ S{t - a){t - a)'^~^f{a)da, te[0,T]; 

Jo 

then F G (^([OjT] x O) and there exists a constant Ct,p such that the following estimate hold 

sup|F(t,0r < C'T,p|/lip([o,r]xC>)- 

Proof of Theorem \4-2\ Using the factorization method (see e.g. 0 for a detailed exposition), 
we write WA{t) in the following form 

^u(i) =-—f S{t — a){t — ar~^Y{a)da, (14) 

Jo 

where ^ 

Y{a)= [ S{a - s){a - s)-'^BdWis), u > 0. (15) 

Jo 

If ^ G O, setting Y{a){^) =: y(cr, .^) we have 


^ per 

Y{a,C) = y,bk - s)-^ek{0dWk{s) 

k=i do 

which is a gaussian random variable with zero mean and covariance given by 

00 

fc=i -^0 

Thanks to Hypotheses (iv)-(v) we see that 

poo 

<M'^bl / e~^ i-2a ^kd^ ^ dM^blul^-^cl < 00. 

k=i do 2 //^ 

Therefore, from the gaussianity of y(-, •), there exists Cp > 0 such that 

E|y(a,or<c(E|y((T,oi')^/'<Cp 


(16) 


(17) 


and we have ^ 

E / / \Y{a,0fm 

Jo Jo 

The conclusion follows from Lemma 14.31 


{di)dt < TCpm{0). 


□ 
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4.2 The Laplace operator 

Let us choose A to be the realization of the Laplace operator with Dirichlet boundary 
conditions. If T is a suitable uniformly elliptic second order differential operator of negative 
type the same argument holds true with some modification. The state equation reads as 

A(i,e) = A^X(t,a+f(X(t,(),n(t)) + 

'X(o,0=^o(a 

^X(t,^)=0, t>0,^edO. 

In this case, the regularizing effect of the heat semigroup can be expressed by 

|S'(t)x|^ 2 ^ A 1)“2 (18) 

where 2 is the norm in the fractional Sobolev space IT®’^(0,M). The Sobolev embedding 
theorem assures that IT®’^(C>,M) (7(0, M) if s > d/2. Then we have that 

IS(t)xl^ < ce-^\t (19) 

In order to satisfy Hypothesis 12.11 we have to choose d < 3, therefore in this framework we 
are able to study stochastic equation of reaction-diffusion type only on bounded domains of 
dimensions 1,2 or 3. Thanks to the estimate m, a good choice for the Hilbert space V is 
the fractional Sobolev space VL^’^(C>, M), for which Hvpotheses l2.2l and l2.3l are easily satisfied. 
Hence we end up with a triplet of the form 

IT*’2(0,M) C (7(a,M) C l2(0,M). 


Concerning the stochastic convolution we want to present here an explicit com¬ 

putation which finalizes the Hypotheses 14.11 presented above. Assumptions (i)-(ii) are easily 
verified with the choice r = 2. The crucial hypothesis is the fourth one. Indeed the constant 
M in (iv) can depends on the domain O. According to a remark in 0 , the two basic do¬ 
mains one has to deal with are the square and the ball, which correspond to the best and 
worst case, respectively, as far as the growth of normalized eigenfunctions is concerned. Let 
us begin taking O = [0, the eigenfunctions of the Laplace operator are 

d 

■ ■-sinikdO, ^ e [0,7r]'^,\kf = kj + ... + kj. 


hence they are uniformly bounded from above by a constant, i.e. |efc(^)| < M, for all /c € N. 
On the other hand, for a general domain with smooth enough boundary, Grieser in 
produce an estimate of form 


i 6 i 


sup|efc(x)| < 
xeo 


It is worth noting that the bound given above is optimal in the case in which the domain is 
a ball, (7 = {x G R'^ : \x\ < 1}. In the following we will refer to it as to the “worst” case. 

In many applications the diffusion operator is written as power of A, more precisely 
B = (—A)“'>', for some 7 > 0. We are looking for some condition on 7 implying the continuity 
in space and time of the stochastic convolution. The general idea is the following: in order 
to have the space-time continuity it is sufficient to colour the noise a little bit more, in other 
words it is sufficient to increase enough the exponent 7 . To do that, we recall a result proved 
by Agmon, see [l|, and we follow for the verification of the trace condition. Let A^(/u) be 
the number of eigenvalues of —A not exceeding /r, then we have an asymptotic estimate of 
the following type 

N{^) = -b o(/r'^/^), for some c > 0 . ( 20 ) 
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The relation above is useful in defining the measure v on the real interval [^i,oo). Here /ii 
is the first (positive) eigenvalue such that 

= ^{k : Hk G D}, 


for every Borel subset D of [/ri,oo). It is obvious that = N{ij,). If we rewrite 

Hypothesis (v) in this setting we get, on one hand, 

OD 

( 21 ) 

k=l 

for the “worst” case in which the domain O is a ball. On the other hand. If O is a hypercube 
the condition is the following 

OO 

< oo, (22) 

k=l 

Let us consider first the worst case, we get 


E 2 , 

f^k 

k=l 


2a-l-2-y+{d-l)/2 




2 a-l- 27 +{d-l )/2 


v{d^) 


'Ml 


/*oo /*oo J 

/•OO /•OO 

J Uy J u^ 


(23) 


'Ml -'Ml 
r^oo 


= c 


I 


^ 2 a- 2 - 27 +(d-l)/ 2 ^^^^^g 


Ml 

OO 


/•OO 

<C s 2 a- 2 - 27 +(d-l)/ 2 +d/ 2 ^^^ 

J fll 


Where we used (l20]l in the last inequality. Obviously, Hypothesis (v) is satisfied if and only 
if the last integral is finite. That is, in terms of 7 , if and only if 


2d-3 

7 > —z -1“ ct- 


(24) 


Proceeding in the same way for O = [0, vr]'^ we obtain the following condition 

d-2 

7 > + a- (25) 

As an example, let us consider for simplicity the two dimensional case. Here we have 

• 7 > a if O = [0,7r] X [0, tt], 

• 7 > 1/4 + a if O = {x G : |x| < 1}, 

with a G (0,1/2). It is interesting to notice that in the authors obtained the condition 
7 > (i/4 — 1/2 (if d = 2, this imply 7 > 0) for the verification of the trace condition. To obtain 
the space-time continuity the condition become more restricitve. In the two dimensional case, 
one has to choose an exponent 7 for the diffusion term that is strictly greater than 1/4. 
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4.3 Cost functional 


One of the motivations for the study of optimal control problems in the space of continuous 
functions is the possibility to consider a very large class of cost functionals. Here we are 
particularly interested in costs of Nemytskii type of the following form 


J{u) =E r [ X{t, 0,u{t)) //(dO dt + E [ giX{T, 0 )) //(dO, ( 26 ) 

Jo Jo Jo 


where /r is a given measure on O C and I, h satisfy the following conditions 


1. The functions 


t,a,u) : H X [0, T] x M x [/ ^ R and <7(0;, a) : H x R ^ R 

are assumed to be measurable with respect ioV® H(R) ® B{U) and H(R) (respectively 
J'T®e(K) and H(R)). 

2. The maps a i-)- l{L 0 ,t,a,u) and r i->- g{uj,a) are C'^(R) and there exists K > 0, k > 0 
such that, P-a.s. 

sup sup \Dfjl{t,a,u)\ < K{1 + \a\^), 
ueu t£[o,T] 

\D^g{a)\ < K{1 + \a\^). 

The two important cases we can deal with are 

• fj-i = m is the Lebesgue measure; 

• g .2 = ^ linear combination of Dirac measures at points G O. 

Observe that a cost of Nemytskii type with g = g 2 reduces to a sum of pointwise evaluations 
which is well dehned in the Banach space E of continuous functions but not in the space H. 
In our case, the development of a theory of stochastic optimal control problems in a Banach 
framework allows to control the evolution of a stochastic reaction-diffusion equation only in 
a finite number of points. 


5 Spike variation method 

In this section we state preliminary results needed for the proof of Theorems 13.11 and 13.21 We 
use the classical approach based on spike variations (in time). Throughout this section we 
suppose that Hypotheses 12.1112.4112.51 hold true. 

Let us consider an optimal control u{t) and the corresponding optimal trajectory X{t). 
To shorten somehow the notation, in this section we write ut,Xt instead of u{t),X{t) and we 
use similar notation for other processes. Let E^ C [0, T] be a set of measure £ of the form 
[to, to + e], for some to € (0,T), then we can introduce the spike variation process 


u 


£ 

t 


ut, if t G [0,T] \ Lie 
wt, if t G Es, 


(27) 


for some admissible control process w. The perturbed trajectory is denoted by X^. We are 
going to construct a new process which is the mild solution to the following equation, 
known as first variation equation: 


jYf = AYf + D^F{Xt,ut)Yf + 6^ Ft 

Yi = 0 , 


( 28 ) 
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where we used the notation 5^Ft := F{Xt,Ut) — F{Xt,ut). More precisely we say that is 
a mild solution to the above equation if for all t > 0, P-a.s. we have 


Yt^ = 


[ S{t-s){D^F{Xs,Us)Y,^+ 5^Fs)ds. 
Jo 


Existence and uniqueness of a mild solution of equation (|28l) with values in E is well known, 
see e.g. for a detailed exposition. Let us recall that, according to Lemma 12.71 for any 
y & E 

{DF{Xt,ut)y,5y)j^<c\y\j^, P-a.s. (29) 


where 5y is an element of the subdifferential of the norm d\y\^ and c € M. We will use the 
following notation throughout the paper: 


\DF{Xt,ut)y\E<CF{uj)\y\E, P-a.s. (30) 

where Cf{uj) denotes a suitable random variable having finite moments of any order. 

Now we want to write the difference of the cost functional J{u^) — J{u) as a function of 
up to a negligible reminder. In order to do that we need some estimate of the difference 
X — X^ — Y^. More precisely we state the following 


Lemma 5.1. If we define If := Xf — Xt, and r]t •= ~ Yfi, then the following estimates 

hold, for k > 1, 

(i) Esupig[o,r]l^t Ie = 

(ii) Esuptg[o,r]l^/lE = 

(Hi) Esupig[o,r]l^flE = 


Proof, (i). Let us denote for brevity zt := Jq ^'>^QdW{s) and consider vt := Xt — zt, 
vf := Xf — Zt- Then 


vt= S{t - s)F{vs + Zs,Us)ds, 

Jo 


= / S{t — s)F{Vg + Zs,ul)ds. 

Jo 


Let solutions to the corresponding equations with S{-) replaced by Sn{-). Therefore 

^e,n yTi g^j,g solutions of the equations 


dt ' 


— AnVt + E(vf + Zs, Us), 


d n,e 
-rVt 

dt * 


= AnVt’"^ + F{v 


.71,£ 


+ ^s,ul) 


(31) 


where An = nA{n — A) ^ are the Yosida approximations of the operator A. Then, if 5t € 
— uflg, we have 


^ \vr - v?\f < {An {v^ - V?) + F{vr + ut) - F{vf + Ut), 5t)E 


< {F{vt' + Zt,ul) - F{vf + zt,ul),5t)E + {F{vf + Zt,u^) - F{vf + zt,ut),5t) ^ 

< c\vt'^ - + \F{v'^ + Zt,U^) - F{vf + Zt, Ut)\E, 


from the contraction property of the semigroup and the dissipativity of the operator F. 
Thanks to the Gronwall lemma we obtain 


I n,e n 
\V^ — V 


t \E — ^ 


Letting n —>■ oo 


f \F{vf + zt,u‘') - F{vt +zt,ut)\Eds. 
Jo 

Tie 

Jo 


\Vt -Vt\E < C 




(32) 
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Now recall that := Xf — Xt = vf — vt, using the polynomial growth of F we end up with 


\C!\e < c / \5^Fs\j^ds < CF{uj)e, 

JEe 

where Cf{uj) has finite moments of any order. Hence Esup^l^fl^ < k > 1. 

(ii). Let be the solution to the following equation 


A 

dt 


+ D^F{Xt, ut)Y,^'^ + 6^Ft. 


(33) 


Using the dissipativity assumptions we get 


dt 


K’If < {AnY^’ + D,F{Xt,ut)Y,^’^ + 5^Ft, 


t ' F 


<c\Yr\E + \S^Ft\E, 


then using the same strategy as before and passing to the limit with n —>■ oo we get the 
required result. 

{in). Now let us define t]^’^ := — vf — If 6t G we have 

^\nl'^\E < + F{vt'^ + zt, uf) - F(v)^ + zt, uf) 

- D^F{vt + Zt, ut){vt'^ - Vt) + D^F{vt + zt, ut)r]t’^, St)E 

< ( D,F (nr +zt+e {vr - vn, (^r - 

Jo 

- D^F{vt + Zt,ut){vt’^ - Vt),dt)E + c\r]t’^\E 

= ( [D,F (nr + zt + e {vr - V ?), ut) - D^F « + zt, )] - v^) dO, 5t)E 

Jo 

+ {[D^F{vt + Zt, ul) - D^F{vt + zt,ul)] {vr - O, dt)E 

+6^D,Ff{vr-vrrE+c\vr\F- 

Thanks to the Gronwall lemma we obtain 


t rl 


\rir\E < C 



'0 ^0 
rt 


+ c/ 

J E, 

Letting n —)■ 0 

WAe ^ C 

+ C 


DxF « + Zs + e {vr - Vt), r) - DxF « + Zs,ul) 
DxF{vs + Zs,ul) - DxF{vs + Zs,ul) 
d^DxFs 


C(E) 


vr - < 


C(E) 


vr-V^s E^^ 


C(E) 


vr-Vs Ed^ 


t rl 


0 JO 


DxF {vs + + 6^1, r) - DxF {Va + Zs,uf) 

C{E) 


S^DxFa 


C{E) 


ICt 


thanks to the continuity of the map DF{ ) : E J^{E). Now according to the estimate 
obtained in {i) and to the Holder inequality we obtain 


E sup 
te[o,T] 


T ri 



E 


0 ^0 


DxF{Vs + + O^a^ r) - DxF{Va + Za,ul) 


4fc 


1/2 


C{E) 


dOds 


+ ( E 



S^DxFa 


Ak \ 1/2 

d.s ) 

C{E) J 
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The continuity of the map DF{-) : E — >■ C{E) implies again that \DxF{vt + zt + — 

DxF{vt + -Zt)tends to zero if e ^ 0. Thanks to the polynomial growth of D^F we 
get the result. □ 

Now we can deal with the expansion of the cost. 

Proposition 5.2. We have the following 

J{u^) - J{u) =E r [S^Lt + DxL{t, Xt,ut)Yf] dt + E {DxG{Xt)Y^) + o(e), 

Jo 

where 5^Lt = L{t, Xt,uf) — L(t, Xt, ut). 

Proof. The difference between the two cost functionals reads as 

rT 
/o 

Let us begin rewriting the running cost part 


J(u") - J{u) =E [ [L{t, Xlul) - L{t, Xt, Ut)] dt + E [G{X^) - G{Xt)] 
Jo 


E 

= E 


= E 


[ [L{t,Xf,ut)-L{t,Xt,ut)]dt + E[ d^Ltdt 
Jo Jo 

[ [ DxL{t,Xt + e^f,ul)f,fdedt + E [ 5^Ltdt 

Jo Jo Jo 

[ C [DxL{t, Xt + e^t, ut) - DxL{t, Xt,ul)] de^tdt 
Jo Jo 

+ E [ DxL{t,Xt,ut)Qdt + E [ d^Ltdt 
Jo Jo 

C [DxL{t 
Jo 

+ E f DxL{t,Xt,ul)Yfdt + E f DxL{t, Xt,uf)r]fdt + E f 5^ Ltdt 
Jo Jo Jo 


< C E sup ICfll; 

V iG[0T] 

rT 


'0 

Xt + eet,ut)-DxL{t,Xt,ut)] dO 

T 


dt 


2\ 1/2 


<Gs- [E 


C [DxL{t, Xt + e^t, ut) - DxL{t, Xt,ut)] dO 
Jo 

2\ 1/2 


dt 


2\ 1/2 


+ (E( sup \T]t\%)^ 

t+0,T] 
rT 




\ \Jo 

+ E [ DxL{t,Xt,ut)Yfdt + E [ S^Ltdt 
Jo Jo 


DxL{t,Xt,ut)dt 
T 

0 


= E / DxL{t,Xt,ut)Yfdt + E / 5^Ltdt + o{£), 
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where we used the estimates given in Lemma r5.ll the continuity of the map DL{-) : E E{E) 
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and the polynomila growth. Regarding the second term we have 


E [G{x^) - g{Xt)] = E / d^g{Xt + ei^)de • 


= E / [D,G{Xt + - D^G{Xt)] + E {D^G{XT)i^) 


< Ce ■ E 


[D,G{Xt + ei^) - D,G{Xt)] dO 


2\ 1/2 


+ E(D,G(XT)y#)+E sup \r^?r\j,D^G{Xt) 

tG[o,r] 

= Ep,G(XT)y#) + o(e), 


where we used again the continuity of the map DG{ ) : E —>■ along with Lemma l5.ll 

By adding the two terms we conclude the proof. □ 

Remark 5.1. In the particular case in which the cost is given by dSl) the expansion of the 
cost reads as 

J{u^) - J{u) = E r / [l{t, XtiO,ul) - l{t, XtiO,ut) + DJ{t, Xt{f),ut)Yf{()] f,{dOdt 

Jo Jo 

+ E [ Dg{XT{0)YS{0Kd0 + o{e). 

Jo 


6 Proof of Theorem 13.1 

The central idea in this proof is the construction of a pair of processes (p, q) by a duality 
argument. We introduce the following auxiliary equation: 

{dy{t) = [Ay{t) + D^F{X{t),u{t))y{t) +-i{t)]dt + r}{t)dW{t) . . 

\y{0) = 0, 

which is a generalized version of the first variation equation introduced before. We want to 
study the linear map 

T ■ (7(-),d(-))'—^ (y(-),y(d')), 

which assigns to the forcing terms 7 (■),'/](■) the solution y(-) and its terminal value y{T). For 
an appropriate choice of functional spaces, r turns out to be well defined and continuous, 
and so is its adjoint 

The pair we are looking for is defined as {p,q) = T*{f,f) when we choose f = DxH{X{T))* 
and f{t) = DxL{t, X{t),u{t))*. The following Proposition will allow to identify the appro¬ 
priate norms for this duality argument. 

Proposition 6.1. Let Hvvotheses {KT[\27A and \2. 5 \ he in force. Suppose we are given r > 2 and 
two processes 7 , 7 such that 7 € Ljr{Ll; L^([0, T], — A; H)) and rj € Ljr{Q-, L^([0, T]; C 2 {K, P))). 

1. The equation (IMI) has a unique mild solution, i.e. a progressive E-valued process y such 
that, P-a.s., t y{t) belongs to L‘^{[0,T]; E) and 

y{t)=[ S{t-s)[DxF{X{s),u{s))y{s)+'y{s))ds+ [ S{t - s)r]{s)dW{s), (35) 

Jo Jo 

for almost every t € [0,T]. 
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2. The following estimate hold: 


E 


fm\ 

JO 


%dt < C 


E 


fT \'-/2 / i-T 

/ |7(s)|^(T-5)-"ds +E 
'0 / \J0 


Us)\\ 


L2{K,V) 


ds 


r/2 


(36) 

If the forcing term 7 (-) belongs to L^{Il-,C{[0,T]-,H)) then the solution is bounded in 
LP‘{H-,E), more precisely the following estimate holds for every t G [0,T]; 


nyit)\l<c 


E 


|7(s)|^(t-s) ^ds'] +^( [ Ms)\\L2iK,v)ds 


(37) 


Proof. Existence and uniqueness. Let us denote 

T{t) := Da;F{X{t),u{t)), T{t) := [ S{t-s)'y{s)ds, Wr,{t) := [ S{t-s)r]{s)dW{s). 

Jo Jo 

We note for further use that, for P-almost every uj £ H, supi^jo^T] \\T{t)\\^E) is bounded by 
a constant Cf{ui). Then equation ([351) can be written: P-a.s., 


y{t) = / S'(t — s)T(s)?/(s)(is + r(t) + VLr,(t) for almost every t G [0,T]. 
Jo 

Now we want to prove that T and Wrj belong to E). 


(38) 


f [ S{t - s)'y{s)ds 
Jo Jo 


E 


dt < E y | 5 (t — 5)7(5 )dt 

Jo (^J^\'yis)\Hit-sy^ds'^ dt 

CE J (^J \7{s)\1j{t — s)~^ds ■ J {t — s)~^ds^ dt 

C'E / (^J \-f{s)\]j{t - s)~^ds'^ dt 

= C'E^ 17(5)1^ {t-s)~^dt^ds 

= CE r\j{s)\liT-s)^-ys 
Jo 


< 


< 


< 


(39) 


< 00 


thanks to the fact that A < 1 and 7 G y{[0,T],—X; H)) C L'^{[0,T]; H)). Regarding 

the stochastic convolution we have 


E 


[ [ S{t - s)ri{s)dWs 

Jo Jo 


dt < CE 


[ S{t - s)r]{s)dWs 
Jo 

fT 

< c'lE y^ Ms)\\l^^j^yys < 00 


dt 


V 


(40) 


where we have used the Ito isometry for the stochastic integral in the Hilbert space V with 
respect to cylindrical noise, as well as Hypothesis 12.21 If we define 

T(t) = y{t) - r(t) - wyt), 

then equation (f35]l is equivalent to the following: P-a.s., 

v{t) = f S{t — s)T{s)v{s)ds + f S{t — s)T{s){r{s) + Wrj{s))ds (41) 

Jo Jo 
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for almost every t € [0,T]. Recalling that sup^gjQ j^] ||r(t)||£{£;) < Cf{uj) we see that P- 
a.s. s 1 -^ T(s)(r(s) + Wr,(s)) belongs to L^([0,T];£') and then it can be proved by an easy 
contraction argument that there exists a unique pathwise solution v{uj,-) G L‘^{[0,T]] E) (in 
fact, v{uj,-) € C{[0,T]; E)) and that, P-a.s., equality (HD) holds for every t G [0, T], 

By the same arguments, uniqueness of the pathwise solution v implies uniqueness for the 
original equation ([35]). 

Estimates. Let us now define the Yosida approximations An = nA{n — A)~^ of A and 
consider the approximating equations 

v'n{t) = AnVnit) + T{t)Vn{t) + T{t){T{t) + VR,(t)), u(0) = 0. (42) 

Then, for any t G [0,T], we have P-a.s. 


^ \Vn{t)\E < {AnVn{t),5n{t))^ + {T{t){Vn{t) + T{t) + W,j{t)), 6n{t)) ^ (43) 

where 6n{t) G (9|uri(t)|g. Using the dissipativity properties proved in Lemma [2.71 we have 
{T(t)vn{t),6n{t))E < cvn{t)^ and from the contraction property of the semigroup it follows 
that (AnVnit), 6n{t))^ < 0. So we obtain 


dt 


\Vn{t)\E < CVnit)E + Cf (w) | (r(t) + W^it))\^, P-a.S. 


(44) 


where the constant Ce{oj) is finite for a.e. w G 12. Using Gronwall’s lemma 


Me < CM fms) + WM))\Eds, P-a.s. 
Jo 


(45) 


Now, it is easy to prove that Vnit) —>■ v{t) in E so we can take the limit as n goes to infinity 
and we obtain 


\vit)\j^<Ciuj) / |(r(s)+ fU,,(s))|^ds. 


Recalling the definition of u(t) we get 


\y{t)\j, < CM fms) + WM))\Eds + |r(t)|g + \WM)\i 
Jo 


<C(a;)(|r(t)|^ + |lU,(t)|^), P-a.s. 
To prove estimate (l36l) we have to consider 


IE / |?/(t)||dt<E 


CMM i\m\E + \wME?dt 


< 


E(C(a;) 


2r’ 




E 


/ 


< K 


E{ \Tit)\l + \WM)\ldt 


E + \WM)\E)dt 

- 1/r 


1/r 


(46) 


(47) 


where we chose r > 2 and we used the finiteness of the moments of Ciu;). Using the same 
computations as in (f3^ and (|in]l we end up with 



< C 


E 
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which is exactly what we looked for. 

Assuming now that 7 G (^([OjT]; H)) then we have 


|r(t)||= [ S{t-s)j{s)di 

Jo 


< 


rt ^ 2 

\S{t - s)7(s)|^ds 


'0 


rt N 2 

<c{ I - s)-^ds 


'0 


< C 


< c 


[ l 7 ('S)|^(t - s) ^ds- [ (t-s) ^ds 
Jo Jo 

[ l7('S)l^(i 

Jo 


— s) ^ds 


thanks to the fact that A < 1. Regarding the stochastic convolution we have 


nWrj{t)\l = E [ S{t-s)r,{s)dW{s 
Jo 


<C-E 


<C-E 


[ S{t-s)r]{s)dW{ 
Jo 

fus)\\ 

Jo 


\L2{Ky) 


Then we have 


E|2/(t)||<E(c(a;)2(|r(t)|| + |r?(t)||)) 


< 


E{C{uj 


,2r' 


oiiA' r 


< K 


E 


E(|r(t)|| + |lT,(t)|| 

r-\ l/r 


r-| l/r 


E + \Wr,{t)VE 


The inequality ()37p now follows immediately. □ 

By ProDosition l 6 .ll r is a bounded linear operator from the space Ljr{Q; T^([0, T], — A; H))x 
Lj-{Q; L‘^{[0,T]] L 2 {K,V))) to x [0,T];E) x E). Its dual operator t* is also a 

bounded linear map 

r* : X [0, T]-EyxL%{n-,Ey^r;{n-,L^i[0,T],-X-,H'))x ([0, T]; L 2 {K, V))) 

where 1/r + l/r' = 1. We have used the fact that ^ H)]' = L‘^{Q x 

[0, T], —A; i7') and that the dual of L 2 {K,V) can be identified with L 2 {K,V') (see e.g. [^, 
page 291). 

Remark 6.1. If R is a separable and reflexive Banach space, then the dual of L3r(nx [0, T]; B) 
is X [0, r]; B') (cfr. e.g. 0). In our case, E is the space of real continuous functions, so 

this result no longer holds. However it is still true that L‘^{Q x [0, T]; B') C L‘^{Q x [0, T]; B)'. 
For our purposes, this is sufficient because we need only to evaluate r* at / and C which are 
much more regular. 

Remark 6.2. A priori it could be possible to choose 7 with values in E' instead of H. 
However this would introduce [Lj-(n; L^([0, T]; B))] , which is difficult to treat. 

We note that by definition the following equality holds 


E 


j w {p{t),'lit))Hdt + E J {q{t),r]{t))^^^Ky^dt = E J {f{t),y{t))Edt + E{C,yiT))E, 

(48) 

where {p,q) = and (y(-),y(T)) are the solution process and its terminal value of 

equation (|55]) corresponding to forcing terms ( 7 ,??). 
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Proof of Theorew, \S.l\. Since u is an optimal control we have J{u^) — J{u) > 0. Thanks to 
the estimates given in Lemma l5.11 we already obtained in Proposition 15.21 that 

o{e)<E[ [6^L{t) + D^L{t,X{t),uit))Yf]dt + E{D^G{XT)Yf{x)). (49) 

Jo 

Now we use duality (more precisely we use equation (|48l) with r/ = 0) for the first variation 
equation (|28l) . which reads as 

E {D,G{XiT)),Y%T))E = E r [h> {p{t), S^Fit))^ - {D,L{p X(t), u(t)), ^^(t))^] dt. (50) 

JO 

Here we have to check that 5'^F € L5r(ll; L^(([0, T], —A; Lf)) but this is true thanks to Hy¬ 
pothesis 12.51 Substituting in ()4^ we get 

1 rto+e 

o(l) <-E / [H{t,X(t),u^{t),p{t))—'H{t,X{t),u{t),p{t))]dt. (51) 

^ Jto 

Now the proof can be concluded by usual arguments, see for instance (26l |. 

□ 


7 The adjoint equation and the proof of Theorem 13.2 


In this section we study a backward stochastic partial differential equation (BSPDE) and we 
characterize the adjoint processes {p, q) as its unique solution. This leads immediately to the 
proof of Theorem 13.21 While the process p takes values in E', it is not easy to formulate 
the BSPDE as an equation in this space, since the usual tools of stochastic calculus are not 
available there, in particular there is no version of the martingale representation theorem in 
E'. We will therefore embed E' in a bigger Hilbert space in which we can use many standard 
techniques and we can uniquely solve the BSPDE in a mild formulation. Then we will prove 
that the solution is indeed more regular and in particular that p takes values in E' (and even 
in H') as desired. 

Recall that if we identify H ~ Lf', we can embed the Gelfand triple E C H' C E' in a 
Hilbertian triple V d H' dV and we obtain the following dense continuous inclusions 

V d E d H ^ H'd E' dV', 


where now we suppose that Hypothesis 12.31 holds true. We first write the dual BSPDE in a 
formal way as follows 


(-dp{t) = [A'p(t) + D^E{X(t),u{t)yp{t) + f(t))]dt - q(t)dW{t) 

\piT) = C, 


where f{t) := DxL{t, X{t),u{t))* and f := DxG{X{T))* take values in E'. This equation is 
given the following precise meaning in a mild formulation: for all t € [0,T], P-a.s. 


p{t) =S{T - tyc + S{s- ty [D,E{X{s),u{s)rp{s) + f{s)] ds 


S{s-t)^q{s)dW{s), 


(53) 


where 5(t)' : E' ^ H' is the adjoint of S{t) : H ^ E (compare Hypothesis 12.11 11 and 
S{t)^ -.V'^V' is the adjoint of S{t) viewed as an operator from V to V, which is possible 
thanks to Hypothesis 12.21 On the other hand, T* : E' ^ E' denotes the dual of any bounded 
linear operator T : E ^ E. Note that equality (|53p has a meaning in the space V\ in which 
the stochastic convolution takes values. 
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Remark 7.1. Notice that, with the previous notation, the adjoint of S{t) : H ^ H (i.e., 
S{t) viewed as an operator on H rather than from H to E) coincides with S{ty\H' : H' —)• H', 
the restriction of 5(t)' : E' ^ H' to H' C E'. 

Remark 7.2. Notice that the BSPDE is linear inp, but the map i—>• DxE{X{t),u{t))* : 

E' —)• E' is not bounded. Using the Nemytskii characterization it is easy to see that DxE{x, u)' 
acts on measures as multiplication by a density. Indeed, for any v £ E 

{D^E{x,u)*p,v)e = {p,D^E{x,u)v)e 

= f'{x{^),u)v{0dp{0 ("54^ 

= / viO ■ f'ix{0,u)dp{C) 

Jo 

which means that it maps p into f'{x, u)p. It is worth noting that also the terminal condition 
lives in E', as well as the forcing term DxL{t, X{t),u{t))*. 

Now we can state the following 

Theorem 7.1. Let Hvvotheses lg.il lg.,91 13-41 hold and r' € (1,2). Then there exists a 
unique mild solution {p,q) £ Ljr{Cl] L‘^{{[0,T], X] H')) x Ljr{Cl] C 2 {K,V'))) to the 

BSPDE (p]l . 

Proof. Existence. 

STEP 1: (Regularization) We shall construct a solution to equation (IMl) by means of an 
approximation technique. The main tool we use is the smoothing effect of the semigroup 
S{ty : E' ^ H'. If we define 

:= S{eyc, fit) := 5(e)7(t), 


then both f,f^it) take values in H' , for all e > 0. We also introduce the Yosida approx¬ 
imations fa of /. Thanks to Lemma 12.81 we know that the associated Nemytskii operator 
Fa{x,u) is Lipschitz in x with respect to the norm of PJ, hence \\DxEa\\^jj'j is uniformly 
bounded by some costant ka which depends only on a. If we set Tait) := DxFa{Xit),u{t)) 
then its adjoint Tait)* is a bounded process with values in T(R'). Next we introduce an 
approximate BSPDE that we first write in a formal way as 

r -dp^’-it) = [A'p^’ft) + Tait)*p^’ft) + fit))] dt - q^’-it)dWit), 

\p^’»iT) = f. 

By the result in (l^ there exists a unique mild solution to this equation, i.e. a process 
(p^’“,g^’“) € L%iD X [0,T],H') x L^ffTfC^iK^H')), 


such that for all t > 0, P-a.s 


p^’“(f) = SiT-tyf+ r Sis-ty[Tais)*p^'fs) + fis)]ds+ r Sis-tyq^^fs)dWis). 
Jt Jt 


(56) 


Our aim now is to prove a uniform estimate for the approximate solution via a duality 
argument. 

STEP 2: (Duality) For all 7 € L^(D; C([0, T], R)) and i] £ L^(D; L2([0, T],/: 2 (-Y, U))) let us 
consider the equation 

ds + f Sit — s)rjis)dWis)., fG[0,T], 

Jo 


y^it)= Sit - s)[Tais)y‘^is)+jis)] 
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which has a unique mild solution y“(t) in the sense of Proposition 16.11 For all n € N, let 
An = nA{n — A)~^ denote the Yosida approximations of the operator A and Sn{t) := 

Then, in a similar way, we can find a solution to 

Vnit) = [ Sn{t - s 
Jo 

Again by 0 there exists a unique solution to the equation 

pX{t) = Sn{T-tyC+ r SniT-ty[T^{srpT{s) + ris)]ds+ r Sn{T-tyq^’^{s)dW{s), 

Jt Jt 

in the space x [0,T],H') x L'^{[0,T], C 2 {K, H')). Notice that, since An are bounded 

operators, the equations can now be written in the stronger form 

fdyyit) = [Anvyit] +Tait)yy{t) +'y{t)]dt + r]{t)dW{t), 

hm = 0 ; 


)[7'a(s)yn('S)+ 7('S)]ds + / Sn{t - s)rj{s)dW(s). 

Jo 


r-dpn’“(t) = + f%t)]dt - q^’°^{t)dW{t), 

lPn’“(T) = C, 


where A* : H' ^ H' is understood as the adjoint of An '■ H ^ H. Computing the Ito 
differential d{H{yy(t),pn°‘(t))jj,) and letting n —>■ oo we get by standard arguments (see e.g. 

3): 


= E rH' (r(t), dt + Eh' (C^ 2/“(r))H. 
Jo 

Since, by Proposition 16.11 ?/“(f) takes values in E we also have 


(57) 


^ jo H' {p"’‘^it),-f{t))j^dt + E {q"’‘^{t),yit))c2iK,H)dt 

= E r E' {r{t),y<^{t))Edt + Ee> (C^ 2 /“(T))e. 

Jo 


(58) 


Let us now dehne the set A := {7 G L5r(0; L2([0, T],-A, iL)) : ||7|Il7(0;L2([o,t]-A,//)) ^ 1} 
and A := {7 G L>(L!; ^^([O, T],-A, F)) n L>(L!; C([0, T], 77)) : \hhr^^n;LHlo,Tl-x,H)) < IJ- 
It easy to see that A is densely embedded in A. If we take 7 = 0 in (l58]l we get 


E 


< sup 




72\ 


7gT L Jo 


E {r{t),y^{t))Edt + E{C,y^iT))E 


= sup 


E {r{t),y^{t))Edt + E{C,y^iT))E 


■jGA L -'0 


< sup 
7 gT 

+ sup 
7 gA L 


< C 


rT \ 

P£/+m 2 - ' 


1/2 


E Ifit^Edt E ly’^ityut 


rT \ 

,,a/j.\\2 - ' 


1 / 2 - 


(e|C 1|,) (E|y“(T)| 

l-T \ 1/2 

E j \nt)\i,dtj + 


1 / 2 ' 


1/2 
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where we used Holder’s inequality and the estimates pHIl and (IHTll with t = T of Proposition 
16.11 for the process Then, noting that |/^|g/ < |/|g/ and ICI^;/ < ICIeo we obtain 


E 




r 72 ' 


1/r' 


< c 


E 


f 


\f{t)\l,dt + E\C\% 


1/2 


where C does not depend on e and a. Let us now look for a similar estimate on We 

first recall that if {ek)k£n is a complete orthonormal system of H such that G P for all 
/c G N, we have 


k 

= '^V'{(f'°‘{t)ek,ri{t)ek)v (^ 9 ) 

k 

= C2(Ky'){<f'°'{t)^'n{t))c2(K,V)- 

Hence we can identify C 2 {K,Vy with the space C 2 {K,V'): for a detailed proof see [^, 
page 291. Writing now ([5^ with 7 = 0 , letting H := {7 G L‘^{[0,T], C 2 {K,V))) : 

{n-L^dOT] C 2 (kv))) — recalling the estimate pHIl and with t = T we get 




'/2\ IP' 


< sup 
rieB L 


< sup 
veB 

+ sup 
n&B L 


r E'{nt),y‘-{t))Edt + EEyC,y^{T)) 
Jo 

Ej^\f{t)\l,d^ ' (E£\y^{t)\ldt^ 


E 

1/2 


< c 


E 


l \fit)\Ut] + (e|C||,) 


X 1/2 


Summarizing, we have obtained the following estimate 


\\f 


\Ly{n-,L2{[0,T],\,H')) 


+ Ik 


lL5l(n;L2([0,T],£2(foV''))) - 


< c 


E 


r\f{t)\i,dt+m% 

Jo 


• (60) 


STEP 3: (Convergence) By pUjl there exists a sequence {ek,ak) (0,0), as k ^ 00 , such 
that (^p^k,ak^q^k,a!k'j =. converges to a limit (p, q) weakly in the product space. Setting 

for brevity T{t) = DxF{X{t),u{t)), we will show that {p,q) is a mild solution to equation 
(j53p . which we now write in the form 


p{t) = S{T-tyC + S{s-ty [T{t)*p{s) + f{s)]ds - S{s-tyq{s)dW{s). (61) 

First we have S{T — tyC^’^ S(T — ty( weakly in H' since, for any h ^ H, 


{S{T - tyC\h)^ = E' (C, S{ek)S{T - t)h)j, ^ E' (C, S{T - t)h)j, = h' {S{T - t)'C,. 
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Then we address the drift term. We take h ^ H and we have 


e/ h'{S{s- tyTa^{s)*pk{s),h)jjds = E h'{T ak{s)*Pk{s), S{s - t)h)ds 


= E^ (Pfc(s), [7hfc(s) -T{s)]S{s -t)h)j^ds 
+ E^ H' {pk{s),T{s)S{s -t)h)jjds. 


Let us consider the first term on the right hand side: 

rT 


E 


H' {pk{s), [Ta^is) - T(s)] S{s - t)h)jjds 

((T - (T - s)-V2 (,) _ T(,)] S{s - t)h)^ ds 


= E 


< E 


E 


r-T X r'/2\ 

(T- s)^\pk{s)yds 


{T-s)-^\[T^,{s)-T{s)]S{s-t)h\lds 


<K-{E( I iT-s)-^\[T^,{s)-T{s)]S{s-t)h\i,ds 


r/2\ lA 


r/2\ lA 


(62) 


Since S{s — t)h € E (for s t) then |Taj,(s)5'(s — t)h — T{s)S{s — t)h\^ 0 for all w € 0, 

s € [0,T]. Next we have 


\T{s)S{s-t)h\]^ = [ \f'{X{s,^),u{s))[S{s-t)h]{^)ym{dx) 

Jo 

< ^sup|/'(X(s,0,w('S))|^ \S{s-t)h\]j 

< Cf{u 2) ■ \h\^. 


where we have used the polynomial growth condition Q on f and Cf{uj) was introduced 
in Using ([9]) instead of ([6|) we prove by similar passages that |Tq,(s)S'(s — t ) h \\ < 

Cf{oj) ■ Thanks to the assumption that A < 1 the right-hand side of (1621) then tends to 
zero by dominated convergence. 

To prove that E f' {Pk{s),T{s)S{s — t)h)F ds ^ E h' {p{s)',T{s)S{s — t)h)^ ds it is 
enough to notice that s —> T(s)iS'(s — t)hls>t lies in the space L'jr{Q-]Lp‘{\Q,T],—\,H)) and 
to use the weak convergence property of pk- 

Finally, regarding the stochastic convolution it is enough to notice that the map 

qi —)■ J S{s — t)""q{s)dW{s) 

is linear and continuous from L'jr{Q] L'^{[0,T]; C 2 {K,V'))) to (U,U'), hence weakly con¬ 
tinuous. Indeed, for fixed t, let us dehne: 

Hq) ■= [ {S{sqis)dW{s) 

Jo 
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Then, using Burkholder-Davis-Gundy inequality we get 


(^£\S{s - tri{s>t}q{s)\%^K,V')ds^ ' ^ 

< • \\q\\L^'{n-,L^{[0,T]-,C2{K,V'))) < 

thanks to Hypothesis 12.31 Then the existence part follows. 


/2\ lA' 


Uniqueness. 

The BSPDE (j53p is linear, then it is enough to prove that if the forcing term /(t) and the 
final condition are zero then also the solution is identically zero. Hence we are dealing 
with the following 


S{s — tyT{s)*p{s)ds — j S{s — t)'q{s)dW(s). 

We apply the operator 5(l/n)' to both sides and setting p" := S^l/nYp, := 
obtain 



p^{t) = [ S{s — t)'S{l/n)'T{s)*p{s)ds — f S{s — tyq'^{s)dW{s). 

Jt Jt 


(64) 

S{l/nyq we 

(65) 


Notice that 


\S{l/nyT{srp{s)\^, < |A(l/n)'| 

< \S{'i/ny\^^i \'^{s)\c{e)\p(^)\e' 

<Cn{i + \xy>y,)\p{s)\H, 

< C'n(w)|p(s)|^,, P-a.s. 


Moreover we have the following integrability condition: for e > 9, 

\p{s)\fj,dt < oo, P — a.s. ( 66 ) 

thanks to the fact that p € T5r(H; T^([0, T], A; H')). Notice that (f 66 ]l fails in general for e = 0. 
Next, by the semigroup property, S{l/n) = 5(l/2n)S'(l/2n). By Hypothesis 12.11 1 we have 
S{l/2n){H) C E. Now we exploit Hypothesis 12.31 to deduce that S'(l/2n)(£') C V and we 
conclude that S{l/n){H) C V and even that S{l/n) € C{H, V), by the closed graph theorem. 
It follows that q‘^{s) G C 2 {K,H') and we have the estimate 



\q'^i^)\c2(K,H') — \^i^/''^y\c(V',H')\qi^)\c2iKy') — ^n\qis)\jc,2{Ky')^ ( 67 ) 

which guarantees that the stochastic convolution in (I65p is well defined. We write equation 
()65p on the time interval [0, T — e] obtaining 

fT-e pT-e 

p^{t) = S{T -e-tycy + / S{s-tyS{l/nyT{syp{s)ds- Sis-tyq^{s)dW{s), 

Jt Jt 

where (JJ := p'^{T — e) is the value of the solution at time T — e. Next we fix an arbitrary 7 
in the space Ljr{Q x [0,T],H) and dehne 

y{t)=f S{t - s)[T{s)y{s)+ -f{s)]ds. 

Jo 

Note that by Proposition 16.1 I t; is the solution to the equation ([5^ with 7 = 0. 
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Replacing the operator A with the Yosida approximations we call the so¬ 

lutions to the corresponding equations. We are in a position to apply the ltd formula to 
(p™’”', thanks to the integrability conditions obtained in (1661) and (1671) . Passing to the 
limit as m —)■ oo and taking the expectation we obtain, see for the details, 

E [ H' {p"'it),'y{t))Hdt + E f [h' {p^,T{t)y{t))H - H' {S{l/nyT{t)*p{t),y{t))H]dt 
Jo Jo 

= Eh> {y{T-e),C)H- 

( 68 ) 


Now we pass to the limit letting n ^ oo. We have 


E 

E 



H' 


H' 


[T-e 

^E / w {p{t),j{t))Hdt, 

Jo 

{p'^{t),T{t)y{t))jjdt ^E [ H' {p{t),T{t)y{t))jjdt, 

Jo 


(69) 


by dominated convergence, due to the fact that p(t) takes values in H'. We also have 


e / e' {T{t)*p{t),y{t))j^^E [ E' {T{t)*p{t),S{l/n)y{t))j^ dt 
Jo Jo 

Indeed, from the analyticity of the semigroup in E, see [^, it follows that almost all paths of 
the process y take values in D{A) (the closure of the domain of A in E), so that S{l/n)y{t) —> 
y{t) in E and the desired conclusion follows from the dominated convergence theorem. So 
from (|U5|) . letting n —)• oo, it follows that 


rT-e 

E/ {p{t),j{t))dt = E{y{T - e),p{T - e)). (70) 

Jo 


Setting t = T — e in (IMl) and taking the conditional expectation given Et-e we obtain 
p{T — e) = E {s)*p{s)ds \ Et-e] so that 


E 


/ yp{t),l{t))dt = E{y{T-e),E[l T{sYp{s)ds \ Et-e\) 
Jo Jt-e 


= E 


IT-e 


{y{T-e),T{s)*p{s))ds 


(71) 


which converges to 0 if e ^ 0. We finally obtain E {p{t),'y{t)) dt = 0 and we get the result 
by the arbitrariness of 7. □ 


We immediately arrive at the following conclusion. 

Proof of Theorem AS.Si Thanks to Theorem l7.ll there exist a unique process p{-) which is the 
first component of the solution to the BSPDE (|53p . The conclusion follows using Theorem 

EH □ 
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